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TECHNICAL FIELD 



This invention relates to accessing dynamic World Wide Web ("Web") 
pages on a Web server by converting Universal Resource Locator (URL) addresses 
from one type to another. 

BACKGROUND 

There are more than a billion documents available on the World Wide Web 
("Web") over the Internet and this number continues to rapidly increase. These 
documents ("Web pages") are stored as files on Web servers. Each of these Web 
10 pages has a unique Web address. These address are also called Uniform Resoi 
y i i Locators (URLs) or Universal Resource Locators (URLs). URLs are more fully 
explained in RFC 1738 "Uniform Resource Locators (URL) Berners-Lee, 
Masinter & McCahill." Static Web Pages and Static Addressing 
« 14 An Internet device, such as a computer using a Web browser, typically 

15 accesses a specific Web page by providing its unique Web address (e.g., a URL). 
^ 16 That Web page is a static file stored on a Web server. The file is simply copied 
17 without change to the requesting Internet device. Every device accessing the static 
is file sees the same results. The stored file remains unchanged until an authorized 

19 user actively modifies the file. These types of Web pages are typically called 

20 "static." A typical URL for a static Web page looks like this: 

21 

22 http: //domain, name, com/pagename. htm 

23 

24 I The "http://" is the value of the scheme field and it identifies the protocol 

25 scheme being used to transmit over the Internet. For the Web, the protocol scheme 
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typically is HyperText Transfer Protocol (HTTP). The "domain.name.com" is the 

value of the hostname field and it identifies the domain (or the Web server) that 

hosts the Web page addressed by the static URL. The actual format of this field 

depends upon the domain name conventions observed. Typically, the format 

includes a domain name and an extension (e.g., microsoft.com). 

The "pagename" is the value of the path field and/or the file-name field. It 

may include a path to the specific Web page. It includes the file name of the 

specific Web page. The ".htm" is the value of the file-extension field and it 

identifies the format of the file. In this example, the format of the static file is the 

ip 10 most common format for a Web page: HyperText Markup Language (HTML). 

M 11 
m 

E3 12 Dynamic Web Pages and Dynamic Addressing 

13 The opposite of a static Web page is a "dynamic" Web page. A dynamic 
Web page is one that is created the moment the page is accessed and it is usually 
created based upon data in a database. Unlike a static Web page, a dynamic Web 

^ 16 page that a viewer sees is not stored intact on a Web server. Instead, a dynamic 

n Web page is generated anew each time it is accessed. 

s A dynamic Web page is generated based upon a stored file containing 

19 instructions and an associated database. Therefore, each instance of a generated 

20 dynamic Web page may be different from a previously generated page using the 

21 same address. There are many different implementations of dynamic Web pages. 

22 The implementation differs from each other in the set of instructions used in the 

23 stored file on the Web server and the type of database accessed. Examples of such 

24 implementations include Active Server Pages (ASP) by the Microsoft Corporation 

25 and "JavaBeans" Activation Framework (JAF). 
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A typical URL for a dynamic Web page may look like this: 

httpJ/domainMame. com/pagename. asp ?parm 1 =valJ&parm2 =val2 

This example uses an ASP implementation. The protocol scheme, hostname, path, 
and filename fields are the same as those fields in the static URL. However, there 
are fields in a dynamic address that are different from fields in a static address. 

The extension ".asp" is a value of a file-extension field and identifies the 
format of the dynamic-page-generation instructions. The extension ".asp" 
indicates that the page is formatted as an Active Server Page (ASP). The _"?" 
symbol is a signal that the URL points to a dynamic page and it separates the 
portion of the dynamic URL referring to a specific file and the portion of the URL 
containing parameters. 

The "parml- ' and "parm2=" elements identify the names of categorized 
parameter. The values of these parameters are used to generate the dynamic Web 
page, "vail" and "val2" are the values of the parameters. The values are typically 
used to access items in a database. A parameter consists of a parameter name and 
its associated value. There can be a series of many parameters. The "&" symbol 
separates each parameter for the other parameters. 

Web Search Engines and Spiders 

No central bibliographic authority exists to catalog the information found 
on the tens of millions of Web sites on the Internet. Generally, two basic 
approaches are available for finding the proverbial needle in this immense Web 
haystack: a subject directory or a search engine. 
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Subject directories, such as "Snap" and "MSN", catalog Web pages and 
organize them by subject. Each Web page is manually (or automatically) analyzed 
and categorized. Users can browse through the various categories and 
subcategories in the subject directories to find a Web site on a particular topic. 
Typically, Web pages are categorized and added to the directory by* professional 
Web searchers or by user submissions. 

A search engine provides a searchable database of indexed keywords. A 
search engine examines Web pages for specified keywords and returns a list of the 
Web pages where the keywords were found. Although search engines are general 
! 10 class of programs, the term is often used to specifically describe systems like "Alta 
yl n Vista" and "Excite" that enable users to search for Web pages on the Web. 

12 A search engine includes two main parts: index searcher and index 

O 13 generator. An index searcher includes a database of indexing keywords of Web 
H pages and logic for searching that database. An index generator includes a 
is "spider" for gathering Web pages and an "indexer" for generating an index into 
4 i6 those pages. 

n Typically, a search engine works by sending out the spider to fetch as many 

pages as possible. The indexer then reads these pages and creates an index based 

19 on the words contained in each page. Each search engine typically uses a 

20 proprietary algorithm to create its indices such that, ideally, only meaningful 

21 results are returned for each query. 

22 Spiders are sometimes referred to as "Web-spiders", "robots", "Web 

23 wanderers", "crawlers", "Web-crawler", "ants", or "worms." These alternative 

24 names refer to programs that have the same basic functionality to visit Web sites 

25 by requesting documents from them. 
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A spider will "crawl" a Web page by following links found on the page. 
Normal Web browsers (e.g., "Internet Explorer") are not spiders, because they are 
operated by humans, and don't automatically retrieve referenced documents. 

Provided with a page by a spider, an indexer parses the document and 
inserts selected keywords into the database with references back to the original 
6 location of the source page. How this is accomplished depends on the indexer. 
Some indexers index the titles of the Web pages or the first few paragraphs. Some 
parse the entire contents and index all words. Some parse the meta-tag or other 
9 special hidden tags. 
O 10 Meta-tags are special HTML tags that provide information about a Web 

page. Unlike normal HTML tags, meta-tags do not affect how the page is 
12 displayed. Instead, they provide information such as who created the page, how 
n often it is updated, what the page is about, and which keywords represent the 
u page's content. Many search engines use this information when building their 

15 indices. 

16 When visiting a Web site, most spiders will check a file called the 
fobots.txt" file. This file informs the spider whether the spider is authorized to 

is search the site and if so authorized, which pages on the site to retrieve. 

19 Single-destination Web sites called "portals" are often a combination of a 

20 "subject directory" and a "search engine." These portals include a search engine 

21 (with its spider and indexer) or are closely associated with a third-party search 

22 engine. These portals often include an organized and customized subject 

23 directory. 
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The Invisible Web 

The Invisible Web is made up of information stored in Web databases. 
Unlike pages on the visible Web, information in databases is generally 
inaccessible to the spiders to compile search engines. 

Search engines typically index the Web by visiting Web pages and 
indexing their content. In particular, the spiders use the links found on pages to 
find new Web pages. The links include static URLs. 

8 Most spiders tend to ignore the content of a dynamic Web address and thus, 

9 the contents of the referenced dynamic Web page. These dynamic Web pages are 

10 often ignored because the format of their dynamic URL is different from the URL 
format of a static Web page. Spiders are often specifically programmed to ignore 

S~ 12 dynamic addresses because of the complexity of navigating through dynamic 
g 13 pages. 

The information found in the databases of dynamic Web sites is not 
indexed by search engines. Therefore, these dynamic Web sites are not found by 
\ i6 those using search engines to search the Web. This huge, unmapped region of the 
Internet is called the "Invisible Web." 
is E-commerce sites with on-line shopping catalogs typically use dynamic 

19 Web pages because their databased inventory is changing constantly. These sites 

20 wish to be indexed by search engines because to help bring users to their site. 
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Conventional Solution 

To allow search engines to index their sites, dynamic sites (such as e- 
commerce sites with inventory) periodically generate "snapshots" of their dynamic 
Web pages. These snapshots are static Web pages generated from corresponding 
dynamic Web pages, which are generated at a moment in time. 

However, there are several significant drawbacks to the "snapshot" 
approach. In a short period of time, the snapshots no longer represent the current 
inventory. Periodically generating the snapshots consumes processing and storage 
resources. 

Although the snapshot approach does allow a search engine to index the 
dynamic Web site, the URLs stored by the search engine are static URLs. 
Therefore, the search engine ultimately directs a user to the snapshot pages rather 
than to the preferable dynamic pages. Dynamic sites would prefer users to use 
their dynamic page to take full advantage of the dynamic nature of the site. If the 
users are using the snapshot pages, then the information seen by the user may not 
be accurate. 

SUMMARY 

A Web address converter converts dynamic Web pages in a manner that 
overcomes the drawbacks of the "snapshot" approach. The Web address converter 
helps dynamic Web sites get the attention of the spiders of Internet search engines. 
With the Web address converter, requests from Web browsers using static 
addresses access corresponding dynamic Web pages and requests from search 
engines generate an instance of a Web page having links with static addresses 
pointing to corresponding dynamic Web pages. 
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The Web address converter performs both Dynamic-to-Static (D-to-S) 
address conversion and Static-to-Dynamic (S-to-D) address conversion. D-to-S 
address conversion is done when generating a spider-friendly main page for a 
spider of a search engine to crawl. S-to-D address conversion is used when a 
browser uses a static address to access a corresponding dynamic Web page. The 
static address that the browser uses was originally created when the spider- friendly 
main page was generated. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Fig. 1 is a schematic illustration of an exemplary computer network (such 
as the Internet) that includes a Web site hosting dynamic Web pages. The site 
includes a converter implementing an embodiment of the Web address converter 
for dynamic Web pages. 

Fig. 2 is a bi-directional address-conversion-mapping diagram showing: 

• from top to bottom, an exemplary implementation of 
dynamic-to-static (D-to-S) address conversion; and 

• from bottom to top, an exemplary implementation of static-to- 
dynamic (S-to-D) address conversion. 

Fig. 3 is flowchart showing a process implementing the Web address 
converter for dynamic Web pages. 

Fig. 4 is flowchart showing a process implementing the Web address 
converter for dynamic Web pages. 

Fig. 5 is an example of a computer capable of implementing the Web 
address converter for dynamic Web pages. 
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DETAILED DESCRIPTION 

The following description sets forth a specific embodiment of the Web 
address converter for dynamic Web pages that incorporates elements recited in the 
appended claims. This embodiment is described with specificity in order to meet 
statutory written description, enablement, and best-mode requirements. However, 
the description itself is not intended to limit the scope of this patent. Rather, the 
inventor has contemplated that the claimed Web address converter might also be 
embodied in other ways/in conjunction with other present or future technologies. 

9 The following description sets forth a Web address converter for dynamic 

10 Web pages that enable the use of static addresses to access corresponding dynamic 
Web pages. The converter provides static to dynamic URL mapping for certain 

jLj 12 incoming static addresses. The converter also provides for the dynamic generation 
of an instance of a Web page containing links to dynamic Web pages, but the links 
H include static addresses pointing to the dynamic Web pages. 

r/t i6 I Dynamic Web Sites and the Internet 

17 Fig. 1 shows a computer network that includes computers linked via 

is Internet 20. Client 22 using a Web browser (e.g., "Internet Explorer" by the 

19 Microsoft Corporation) may access the Internet via one or more Internet Service 

20 Providers (ISPs) 24. 

21 Fig. 1 shows a search engine (with a spider) 26 connected to the Internet 

22 20. The search engine 26 includes a Web server (or collection of Web servers) 28 

23 and a database 29. The database 29 stores indexed keywords associated with 

24 source Web pages. 
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The search engine 26 contains the searchable database 29 of keywords that 
are associated with Uniform Resource Locators (URLs) pointing to Web pages. 
Typically, a search engine uses a "spider" program module to fetch as many 
documents (i.e., Web pages) as possible. An "indexer" of the search engine reads 
these documents and creates a database based on the words contained in each 
document. 

Fig. 1 also shows a static Web site 30 connected to the Internet 20. The 
static Web site 30 includes a Web server (or collection of Web servers) 32. The 
static Web site 30 contains static Web pages stored in a file located in the file 
system of the Web server 32, and stored in non-volatile memory, such as disk 
drives 34a-34d. 

Disk drives 34a-34d illustrate a hierarchical path to a file containing a static 
Web page on drive 34d. Suppose that drive 34a is named "root"; drive 34b is 
named "subl"; drive 34c is named "sub2"; drive 34d is named "sub2"; and a file 
36 named "file.htm". The path to file 36 would be "/root/sub l/sub2/sub3/file.htm" 

Although the path is shown as multiple disk drives 34a-34d, the path 
typically will be both file directories and subdirectories on the same file system of 
the Web server. Alternatively, the path may be across multiple Web servers. 

Fig. 1 shows a dynamic Web site 40 connected to the Internet 20. The 
dynamic Web site 40 includes a Web server (or collection of Web servers) 42. The 
dynamic Web pages are dynamically generated by the server 42 based upon data 
stored in database 44. 

The dynamic Web site 40 also includes a Web address converter 46. The 
converter 46 implements the exemplary embodiment of the Web address 
converter. The converter 46 may be a filter designed to examine incoming 
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requests of the dynamic Web site. Although the converter 46 is shown as a filter 
that is separate from the server 42, the converter may be part of the server. 

A filter determines whether incoming requests meet given requirements. If 
so, then it performs specified actions and/or modifies the request before passing it 
along to the Web server 42. A filter is typically installed on port 80 (TCP:80) to 
capture Web-related traffic. 

Although a filter implementing the converter 46 may be hardware, it is 
software in the exemplary embodiment. Specifically, the converter 46 is an ISAPI 
(Internet Server Application Programming Interface) filter. ISAPI is an easy-to- 
use, high-performance interface and API for back-end applications for "Internet 
Information Server" (IIS) by the Microsoft Corporation. An ISAPI filter is a 
replaceable dynamic link library (DLL) that the server calls on every HTTP 
(Hypertext Transfer Protocol) request. When the filter is loaded, it tells the server 
what sort of notifications in which it is interested. After that, whenever the 
selected events occur, the filter is called and given the opportunity to process that 
event. 

Alternatively, the filter implementing the converter 46 may be a software 
program designed for that purpose, it may be part of a Web serving application, 
and it may be part of the operating system. Also, the functional components of the 
filter may be distributed over multiple Web servers in one site or multiple sites. 
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Web Address Mapping 

2 Fig. 2 shows an example of Web address mapping for converting Web 

3 addresses from one type to another type that is implemented by Web address 
converter 46. This address conversion is done so that requests from Web browsers 
using static addresses can access dynamic Web pages. Also, it is done so that 
requests from spiders of search engines generate an instance of a spider-friendly 

7 Web page having links with static addresses pointing to dynamic Web pages. 

8 There are two types of Web address conversion: Static-to-dynamic (S-to- 
D) and dynamic-to-static (D-to-S). Each is the reverse of the other. S-to-D 

O io address conversion is performed to redirect requests from Web browsers using 
ii static addresses to corresponding dynamic Web pages. D-to-S address conversion 
J 12 is performed to generate an instance of a spider- friendly Web page having links 

13 with static addresses pointing to corresponding dynamic Web pages. 

14 Fig. 2 shows the address conversion mapping generally at 100. Reading 
from top to bottom shows D-to-S address conversion as represented by "down" 

J i6 arrow 102. Reading from bottom to top shows S-to-D address conversion as 
3 n represented by "up" arrow 104. Unless specifically stated otherwise, the addition 
of any field or value in one conversion direction corresponds to the removal of the 
same field or value in the other conversion direction, and vice versa. 

20 Block 110 shows a generic example of a dynamic address at 112. The 

21 dynamic address points to a dynamic Web page. The dynamic address 1 12 shown 

22 in Fig. 2 is 

23 

24 | http : //hostname/path. asp?parmx=valx&parmlast=vallast 

25 
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Block 120 shows the fields removed from the dynamic address 112 to 
convert the dynamic address of 112 to a static address that points to the same 
dynamic Web page. The following fields are removed: 

• a field-separator symbol ("/") at 122; 

• a dynamic file extension (".asp") and a file-parameter-separator 
symbol ("?") at 124; 

• one or more parameter names ("parmx") and a symbol ("=") that 
equates each of them to value at 126; 

• a file-parameter-separator symbol ("&") used to separate each 

10 parameter from each other parameter at 128 (a parameter includes a 

1 1 parameter name and its value); and 



^ 12 * a last parameter name ("parmlast") in the series of parameters and 

p[. 13 the symbol ("=") that equates it to its value at 130. 

u If there is only one parameter, then "parmlast" will be the only parameter name 

IP 15 and there will be no "&" symbol. 

; a 
t : 

.j i6 Block 140 shows the fields that are found in both the source Web address 

n n and the converted Web address. Generally, these common fields are the basis for 
converting from one address type to another. These common fields map from the 

19 original address to specific positions in the converted address. 

20 The common fields include "scheme" field at 142 that identifies the 

21 protocol scheme being used to transmit the request for a Web page and the 

22 protocol scheme to be used to send the requested Web page. Since the exemplary 

23 embodiment is used for converting Web addresses, the typical protocol scheme is 

24 | HTTP; therefore, the value of the scheme field 142 normally is "http://". 

25 
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Other common fields shown in block 140 include a "hostname" field at 
144, a "path" at 146, a "valx" at 148, and a "vallast" at 150. 

The hostname field 144 contains a name of a Web server (or Web site) 
hosting the dynamic Web page. The name in the hostname field at 144 is 
"hostname" and it is derived from the static Web address 112. 

The path field 146 contains a name of a hierarchical path used to access the 
file containing instructions for generating the dynamic Web page. The name in the 
path field at 146 is "path" and is derived from the static Web address 112. Similar 
to the hierarchical path illustrated by the disk drives 34a-34d of Fig. 1, the path in 
the path field 146 may be one or more levels. For example, the path may be 
"path"; "root/path"; or "root/subpathl/subpath2/path". 

The valx value field 148 may be one or more parameter values. Each valx 
value field contains a value associated with a parameter name (like "parmx" 126). 
The "=" symbol equates each parameter with a specific value. For example, 
"parmx" has a value of "valx". 

The vallast value field 150 a value associated with the last parameter name 
(like "parmlast" 130). The "=" symbol equates the last parameter name with a 
specific value. For example, "parmlast" has a value of "vallast". If there is only 
one parameter in the dynamic address 112, then "vallast" will be the only value 
and there will be no valx values. 
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Block 160 shows the fields added to the common fields of block 140 to 
convert the dynamic address of 112 to a static address that points to the same 
dynamic Web page. The following fields are added: 

• a field-separator symbol ("/") and an alias-indicator flag ("flag") at 
162; 

• one or more field-separator symbol ("/") at 164; 

• a static file extension (".htm") that indicates that the format of the 
file is static although the file's format is actually dynamic. 

The alias-indicator flag 162 is inserted into the resulting static address so 
io that the exemplary converter will recognize it as a static address that does not 
4 1 1 point to an actual static Web page. Rather, the address with the alias-indicator flag 

12 points to a dynamic Web page. 
:? 2 13 Block 170 shows an example of a converted static address at 172. This 

H static address points to the same dynamic Web page that the dynamic address of 
is 112 does. The static address 172 shown in Fig. 2 is 



h. s 16 



17 



http : //hostname/f lag/path/valx/vallast . htm 

18 

19 The arrows between the fields of block 160 and the static address 172 of 

20 block 170 illustrate the relative mapping of the fields into the static address. The 

21 fields 162-166 are inserted between the common fields 142-150 of block 140 to 

22 form the static address 172. 

23 In Fig. 2, the common fields 142-150 of block 140 are shown underlined in 

24 the dynamic address 112 and the static address 172. The underlining makes the 

25 common fields easier to locate in the originating dynamic address 112 and the 
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resulting static address 172. The underlining also highlights the existence and the 
relative location of each field in the both addresses. Furthermore, it distinguishes 
the common fields from other fields that are specific to only one type of address. 

The arrows between blocks 110 and 130 illustrate which fields are removed 

5 from the dynamic address 112 and where the fields are removed. Likewise, the 

6 arrows between blocks 160 and 170 illustrate which fields are added to the 

7 common fields to form the static address 172 and where the fields are added. 
The above description of Fig. 2 is given reading from top to bottom to 

illustration D-to-S address conversion. However, as mentioned above, the 
io addition of any field or value for the above-described D-to-S address conversion 
til n corresponds to the removal of the same field or value in the S-to-D address 
%4 12 conversion. Likewise, the removal of any field or value for the above-described 
C3 i3 D-to-S address conversion corresponds to the addition of the same field or value 
14 | in the S-to-D address conversion. 

15 

16 I Alternative Parameter Value Mapping 
iy 17 In the exemplary embodiment of the Web address converter, the converter 

places the parameter values (such as "valx" or "vallast" ) based upon their relative 

19 position in the original dynamic address. For example, "valx" is before "vallast". 

20 When such a static address is converted back (re-mapped) to a dynamic address, 

21 the converter assumes the parameter fields based upon the relative positioning of 

22 the parameter values in the static address. For example, "parmlast" is assumed to 

23 be equal to "vallast" because "vallast" is the last field in the static address. 



24 



25 



Lee & Hayes. PLLC 



16 



0427001329 MSJ-487US.PA T.APP.F/NAL. DOC 




In an alternative embodiment, the parameter associations may be specified 
in the static address so that values are re-mapped back specific parameters. This 
may occur in a variety of ways. As one example, assume the following original 
dynamic address: 

http : / /hostname/path. asp?parmA=valA&parmB=valB&parmC=valC 
s | This dynamic address may be converted into this static address: 

9 

10 | http : //hostname/f lag/path/parmA=valA/parmB=valB/parmC=valC .htm 
u 

12 An alternative encoding may be: 

j 13 | http : //hostname/f lag/path/parmA/valA/parmB/valB/parmC/valC. htm 

14 

is Any encoding that includes both the parameter name and the value in such 

16 a way that the original pairing can be extracted may be used. When such an 

} 1? alternative embodiment of the Web address converter maps this static address 

is backs to its original dynamic address, it will not assume how to map the values 

19 back to the dynamic address. Rather, it will examine the static address to see 

20 exactly how to map the values to which parameters because the parameters 

21 themselves are specified in the static address. 

22 
23 
24 
25 
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Method of Web Address Analysis and Conversion 

Fig. 3 shows a process for handling a request for dynamic Web pages that 
is implemented by the Web address converter. At 200, a request to access a Web 
page is received by a Web server or a filter (such as the filter 46 in Fig. 1). For the 

5 exemplary implementation, the filter receives the request. Such a request may be 

6 a GET request in HTTP. 
If the request includes a dynamic address, then the filter passes it back to 

the Web server for normal dynamic Web page invocation. Otherwise, the request 
includes a static Web address. At 202, the filter parses the request to extract the 
io static address included therein. 

At 204, the static address is examined to determine if it includes an "alias- 
12 indicator flag." The alias-indicator flag is any set of unique (within the Web 
n server) alphanumeric characters that may be used within a static address to 

14 identify the address as being an "alias address." For example, the flag may be 

15 "root", "flag", "static", or "alias". The field containing the alias-indicator flag is 

16 shown at 162 in Fig. 2. An "alias address" is a static address that points to a 

17 dynamic Web page rather than a static one. There are two sources of an "alias 
address." 

The first source is where an alias address is a resulting static address from 
the exemplary D-to-S address conversion. During the D-to-S address conversion, 

21 the alias-indicator flag is inserted into the resulting static address in the manner 

22 shown at blocks 160 and 170 in Fig. 2. 

23 The second source of an alias address is an address stored in a file on the 

24 Web server. That file is called the "robots.txt" file. The address stored in this file 

25 includes the alias-indicator flag. 
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This "robots. txt" file informs a spider whether the spider is authorized to 
search the site and if so authorized, which pages on the site to retrieve. In the 
exemplary embodiment, the "robots.txt" file authorizes a spider to access only one 
page and that page is a "spider-friendly main" Web page (i.e., "spider-friendly 
index" page). The spider-friendly main page is generated upon access. The 
generation of the spider-friendly main page is illustrated in Fig. 4, which is 
discussed below. 

Therefore, if the static address parsed from the request is an "alias address", 
then the process will proceed to block 208 for further analysis of the static address. 

10 Otherwise, the static address is a conventional static address and it points to an 

1 1 actual static Web page. In this case, the process will proceed to block 206 to allow 
□ 12 access to an actual static Web page as normal. The filter will hand the static 

address to the Web server so that the Web server can access and send the 
14 referenced static Web page, 
p 15 At 208 of Fig. 3, the static address (which is now known to be an alias 

ify 

y 16 address) is further examined to determine if the address includes parameter values. 
Parameter values are fields after the path in the static address. Referring again to 
Fig. 2, "/valx/vallast.htm" are parameter values after "path" in static address 172 
of block 170. 

20 If the static address does not have parameter values, then the static address 

points to the spider-friendly main page. Therefore, the process proceeds to block 
210 and the generation of the spider- friendly main page illustrated in Fig. 4, which 

23 is discussed below. If the static address includes parameter values, then the static 

24 address points to dynamic Web page, then the process proceeds to block 212. 

25 
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At 212, the static address is converted into its mapped dynamic address in 
the manner illustrated from bottom to top in Fig. 2 and described above as the S- 
to-D address conversion. This produces a dynamic address that points to an 
existing files storing instructions from dynamically generating the dynamic Web 
page. 

At 214, the dynamic Web page referenced by the converted dynamic 
address is invoked. When generating the web page, meta-tags may also be 
inserted. Such meta-tags help the search engines properly index each individual 
web page. The meta-tags may be derived from a variety of sources. For example, 
the meta-tags may be dynamically generated from the content of the web m page._ 
Alternatively, the meta-tags may be retrieved from the database. In yet a further 
alternative, the meta-tags may be from a variety of sources, such as a combination 
of meta-tags that are dynamically generated and meta-tags that are retrieved from 
a database or provided by some other component. 

The purpose of inserting these meta-tags is to enhance the chances of the 
specific page being found during a search of a search engine. The use of meta- 
tags increases the breadth of keywords used on the page. This increases the 
likelihood of the specific page being found during a search on a search engine. At 
216, the dynamic Web page is sent to the requester. Dynamic Web pages have 
instructions and the dynamic addresses have parameters and values. These 
instructions, the parameters, the values are used to generate (i.e., invoke) a Web 
page based upon information in the database. At 218, on-going Web page access 
continues as normal 
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In blocks 204 and 208, the filter makes three determinations in this order: 

• Does the request include a normal static address to access actual 
static Web pages? 

• If not, then does the request include a "special" static address? A 
static address is "special" if it includes an alias-indicator flag. 

• If the static address is "special", then does the address point to the 

7 spider- friendly main page or does it point to somewhere else? 

8 The exemplary converter makes these three determinations by detecting an alias- 
indicator flag (like "flag" of block 162 of Fig. 2) and parameter values (like "valx" 

io and "vallast" of blocks 148 and 150 of Fie. 2). 

i i I Alternatively, the determinations may be made by employing multiple flags 

O 12 in the static address. Each flag helps the filter make each determination. In 
D 13 addition, the spider- friendly main page may have a defined address that contains 
H no flags itself, but the filter will recognize the defined address, 
is Fig. 4 shows more of the process for handling a request for dynamic Web 

i6 pages that is implemented by the Web address converter. Specifically, Fig. 4 
O n shows the generation of the spider- friendly main page, which is represented by 
is block 210 of Fig. 3. 

19 At 250, the filter dynamically generates an initial main page. This initial 

20 main page includes links containing dynamic addresses pointing to dynamic Web 

21 pages. These dynamic Web pages may represent the current inventory for an e- 

22 commerce site. 

23 At 252, meta-tags are inserted into the initial main page. The filter has a 

24 defined set of meta-tags that are inserted each time a main page is generated. 

25 Alternatively, the filter dynamically generates a set of meta-tags that are created 
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based upon current information in the database. Further, the filter may directly 
retrieve the meta-tags from the database. 

Like the meta-tags generated for a specific dynamic Web page of block 
214, the purpose of inserting these meta-tags into the main page is to enhance the 
chances of the page being found during a search of a search engine. The use of 
meta-tags increases the breadth of keywords used on the page. This increases the 
likelihood of the main page being found during a search on a search engine. 

At 254, the dynamic addresses in these links are converted into static 
addresses in the manner described above as D-to-S address conversion and shown 
q 10 from top to bottom in Fig. 2. 

At 256, the filter sends the spider- friendly main page to the requester. The 
12 requester is typically a browser or a spider. At 258, on-going Web page access 
o 13 continues as normal. 

£ 14 

15 Exemplary Computer 

16 Fig. 5 shows a computer that is an example of a Web server that is used 
0 n with the exemplary implementation of the Web address converter for dynamic 

18 Web pages. This computer may be a client running a Web browser, a Web server 

19 running a search engine, a Web server hosting a static Web site, a Web server 

20 hosting a dynamic Web site, or any computer within a communications network 

21 (such as the Internet) 

22 As shown in Fig. 5, computer 330 includes one or more processors or 

23 processing units 332, a system memory 334, and a bus 336 that couples various 

24 system components including the system memory 334 to processors 332. Bus 336 

25 represents one or more of any of several types of bus structures, including a 
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memory bus or memory controller, a peripheral bus, an accelerated graphics port, 
and a processor or local bus using any of a variety of bus architectures. 

The system memory includes read only memory (ROM) 338 and random 
access memory (RAM) 340. A basic input/output system (BIOS) 342, containing 
the basic routines that help to transfer information between elements within 
computer 330, such as during start-up, is stored in ROM 338. 

Computer 330 further includes a hard disk drive 344 for reading from and 
writing to a hard disk, not shown, a magnetic disk drive 346 for reading from and 
writing to a removable magnetic disk 348, and an optical disk drive 350 for 
g io reading from or writing to a removable optical disk 352 such as a CD ROM, DVD 
mn ROM or other optical media. The hard disk drive 344, magnetic disk drive 346 
q 12 and optical disk drive 350 are each connected to bus 336 by one or more interfaces 
354. 



0'13 



s u The drives and their associated computer-readable media provide 
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nonvolatile storage of computer readable instructions, data structures, program 
16 modules, and other data for computer 330. Although the exemplary environment 
O 17 described herein employs a hard disk, a removable magnetic disk 348 and a 
i8 removable optical disk 352, it should be appreciated by those skilled in the art that 
other types of computer readable media which can store data that is accessible by a 

20 computer, such as magnetic cassettes, flash memory cards, digital video disks, 

21 random access memories (RAMs), read only memories (ROM), and the like, may 

22 also be used in the exemplary operating environment. 

23 A number of program modules may be stored on the hard disk, magnetic 

24 disk 348, optical disk 352, ROM 338, or RAM 340, including an operating system 

25 358, one or more application programs 360 (such as a Web browser), other 
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program modules 362, and program data 364. A user may enter commands and 
information into computer 330 through input devices such as keyboard 366 and 
pointing device 368. Other input devices (not shown) may include a microphone, 
joystick, game pad, satellite dish, scanner, or the like. These and other input 

5 devices are connected to the processing unit 332 through an interface 370 that is 

6 coupled to bus 336. 

7 A monitor 372 or other type of display device is also connected to bus 336 
via an interface, such as a video adapter 374. In addition to the monitor, personal 
computers typically include other peripheral output devices (not shown) such as 

io speakers and printers. 
Ul n Computer 330 can operate in a networked environment using logical 

Q 12 connections to one or more remote computers, such as a Web server 382. Web 

13 server 382 typically includes many or all of the elements described above relative 

14 to computer 330. In addition, a Web database 384 may be connected to the Web 

15 server 382. 

16 A logical connection that is not depicted in Fig. 5 is a local area network 

17 (LAN) via network interface 386 and a general wide area network (WAN) via a 
modem 378. Such networking environments are commonplace in offices, 

19 enterprise- wide computer networks, intranets, and the Internet. 

20 Depicted in Fig. 5, is a specific implementation of a WAN via the Internet. 
Over the Internet, computer 330 typically includes a modem 378 or other means 

22 for establishing communications over the Internet 380. Modem 378, which may 

23 be internal or external, is connected to bus 336 via interface 356. 

24 In a networked environment, program modules depicted relative to the 

25 personal computer 330, or portions thereof, may be stored in the remote memory 



21 



Lee & Hayes, PLLC 



24 



0427001329 MS1-487US.PA T.APP.FINAL. DOC 



storage device. It will be appreciated that the network connections shown and 
. described are exemplary and other means of establishing a communications link 
between the computers may be used. 

Example: "Acmeto vs. store" Dynamic Web Site 

The following example is provided to help illustrate how the exemplary 
implementation of the Web address converter might be used. 

Suppose that a fictional company called "Acme Toy Store" has an e- 
commerce Web site where it sells toys and playthings! Acme's Web address is 
10 "http://acmetoys. store". Like a "brick-and-mortar" store. Acme's e-commerce site 
J|j 11 has an inventory that is constantly changing as existing items are sold and shipped 
12 and new items arrive. Therefore, Acme's Web site has dynamically accessible and 
q 13 updateable database for tracking inventory. Acme can track its inventory in real 
14 time and users of Acme's site can order "in-stock" items in real time, 
is Acme would like for Internet search engines to be a source of inexpensive 

16 advertisement for its toys. Acme would like for the spiders of search engines to 
crawl its Web site and index its products. That way, a search engine may direct a 
is user (who is searching for that hot new toy from overseas) to Acme's site. 
Preferably, the search engine may direct the user to the particular page on the site 
referring to that hot new toy. Acme would like users to go directly to their 

21 dynamic Web pages so that the user can view the current product information. 

22 Acme implements the exemplary embodiment of the Web address converter 

23 in an IS API filter on its Web site. Acme modifies is "robots.txt" file on its Web 

24 server to authorize and direct spiders to crawl a spider-friendly main page at 

25 "http://acmetoys.store/flag/index.htm". The alias-indicator flag is "flag." 
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Subsequently, a spider from a search engine arrives to crawl Acme's site. 
The spider examines the "robots.txt" file and proceeds to access the spider- 
friendly main page. 

Referring to the flow charts shown in Figs. 3 and 4, the filter receives (at 

5 200 of Fig. 3) a request from the spider to access a Web page. The filter parses (at 

6 202) the request and pulls out this static address: 
"http ://acmetoys . store/flag/index .htm" . 

Upon examination (at 204) of this address, the filter determines that it 
includes "flag", which is the alias-indicator flag. Upon further examination (at 

10 208) of this address, the filter determines that the address does not include any 

11 parameter values. 

12 Therefore, the filter generates an initial main page having links with 
n dynamic addresses (at 250 of Fig. 4). The dynamic addresses point to dynamic 
14 Web pages describing the available toys and providing a means for purchasing 

r 15 such toys. The filter inserts (at 252) a set of meta-tags into the initial main page. 

s t 

si i6 The meta-tags have many keywords related to toys and playthings. 
I n D-to-S conversion (at 254) is performed on the dynamic address so that the 

links include static addresses. These static addresses will include the alias- 
indicator flag and one or more parameter values. 
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Below is an example of Web address conversion using a dynamic Web 
page on Acme's site that describes a fictional product called "megawheel." The 
following example of D-to-S address conversion is done in accordance with such 
D-to-S address conversion illustrated in Fig. 2 and described above: 

Megawheel dynamic address: 

http : / /acmetoys . store/ toys/wheeled . asp?age=4 -or -under &name=megawheel 

The following is a table identifying the fields illustrated in Fig. 2 and described 
above. The field names are italicized. The table also gives the specific values of - 
these fields: 



Common Fields 


Dynamic Fields Removed 


Static Fields Added 


scheme 


http:// 


field separator 


I 


field separator 


1 


hostname 


acmetoys.store 


dynamic file extension 


".asp" 


flag 


flag 


path 


toys/wheeled 


file-parameter-separator 


? 


field separator 


/ 


valx 


4-or-under 


parameter 


age 


field separator 


/ 


vallast 


megawheel 


parameter separator 


& 


field separator 


/ 






last parameter 


name 


static file extension 


u .htm" 



Resulting example static address (after D-to-S address conversion): 



Megawheel static address: 

http: //acmetoys . store/f lag/toys/wheeled/4 -or -under /megawheel . htm 
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After the D-to-S address conversion of the address in the links in the initial 
main page, the page becomes the spider-friendly main page. The filter sends (at 
256) the spider- friendly main page to the original requester, which was the spider 
of the search engine. 

The indexer of the search engine indexes the spider-friendly main page of 

6 Acme's Web site. Keywords from the spider- friendly main page are stored in the 

7 search engine's database. 
Subsequently, a user searches for a toy called "megawheel" on the search 

engine. It discovers a link to Acme's Web site. That link includes the megawheel 
~ io static address (as shown above). The user clicks on that link and is whisked away 
!» n to Acme's site. 

12 Again referring to the flow charts shown in Figs. 3 and 4, the filter receives 

n (at 200 of Fig. 3) a request from the browser of the user to access a Web page. 
The filter parses (at 202) the request and pulls out the megawheel static address: 
j£ i 5 "ht1p://acmetoys.store/flag/toys/wheeled/4-or-under/megawheel.htm". 

m 

%r 16 Upon examination (at 204) of this address, the filter determines that it 

n includes "flag", which is the alias-indicator flag. Upon further examination (at 

is 208) of this address, the filter determines that the address includes parameter 

19 values. Specifically, the address includes "4-or-under" and "megawheel" 

20 parameter values. The filter performs S-to-D address conversion (at 212) on the 

21 static address so that browser can be redirected to the megawheel dynamic Web 

22 | page. 

23 
24 
25 
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S-to-D address conversion is done in accordance with such S-to-D 
conversion illustrated in Fig. 2 and described above. The static address is 
converted into megawheel dynamic address: 

"htlp : //acmetoys . store/toy 

Note that this is the megawheel dynamic address and the same address that was 
the source for the D-to-S address conversion performed when generating the 
spider- friendly main page. 

After this S-to-D address conversion, the desired megawheel dynamic Web 
page is invoked and megawheel-related meta tags are inserted into the page (at 
214). The page is sent to the browser of the user (at 216), Therefore, the_user 
views the current dynamic Web page for the "megawheel" product. The user found 
the megawheel dynamic Web page using a corresponding megawheel static 
address stored in a searchable database of a search engine. 

Conclusion 

Using the exemplary implementation of the Web address converter, spiders 
can fetch and indexers can index the dynamic content of dynamic Web sites. 
Furthermore, browsers using a static address to a dynamic Web page of a dynamic 
Web site can access the referenced dynamic Web page, rather then a stale, static 
copy. 

Although the address converter has been described in language specific to 
structural features and/or methodological steps, it is to be understood that the web 
address converter defined in the appended claims is not necessarily limited to the 
specific features or steps described. Rather, the specific features and steps are 
disclosed as preferred forms of implementing the claimed web address converter. 
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